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B Sequence: 



(Seq. ID No. : 1) 




n=2 

ac aa ac ca ca 



Lookup: aa = w, cc = y, 
ac = x, ca = z 



ac aa ac ca ca = x w x z 



Dataset 1 
35412 



Figure 2 
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| Combine datasets ~| 
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Unrandomise the 
sequence of numbered 
symbols 



Convert the string of 
symbols to characters by 
using a lookup table. 
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Identify sequences of 
interest (eg. start and end 
of gene) using annotation. 
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| HIA1 ~] 
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Name Chromosome Copies Seq.FragID. Start Stop 
HIA1 12 1 94 10 1998 



Dataset 94a Dataset 94b 
3 5 412 xzzxw 


1 




| x3 z5 z4 xl w2 



1 

xl w2 x3 z4 z5 

1 

Lookup: w = aa, y = cc 
x = ac, z = ca 

xwxzz =acaaaccaca 



1 



Sequence: 


Start 


Stop 


acaaaccaca 


10 


1998 



(Seq. ID No. : 1) 



Figure 3 



